Impossibility of deducing preferences and rationality from human policy

نویسندگان

  • Stuart Armstrong
  • Sören Mindermann
چکیده

Inverse reinforcement learning (IRL) attempts to infer human rewards or preferences from observed behavior. However, human planning systematically deviates from rationality. Though there has been some IRL work which assumes humans are noisily rational, there has been little analysis of the general problem of inferring the reward of a human of unknown rationality. The observed behavior can, in principle, be decomposed into two components: a reward function and a planning algorithm that maps reward function to policy. Both of these variables have to be inferred from behavior. This paper presents a No Free Lunch theorem in this area, showing that, without making ‘normative’ assumptions beyond the data, nothing about the human reward function can be deduced from human behavior. Unlike most No Free Lunch theorems, this cannot be alleviated by regularising with simplicity assumptions. We show that the simplest hypotheses which explain the data are generally degenerate. The paper will then sketch how one might begin to use normative assumptions to get around the problem, without which solving the general IRL problem is impossible. The reward function-planning algorithm formalism can also be used to encode what it means for an agent to manipulate or override human preferences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Rationality as a Basis for Group Decision Making

This paper deals with the group decision making problem, assuming that each individual defines his/her opinion through fuzzy binary preference relations, in parallel to the classical approach due to Prof. Arrow. In particular, it is postulated that the main reason for the discouraging impossibility theorems is neither in the domain of admissible preferences or in the concept of solution (Social...

متن کامل

Arrow theorems in the fuzzy setting

Throughout this paper, our  main idea is to analyze the Arrovian approach in a fuzzy context, paying attention to different extensions of the classical Arrow's model arising in mathematical Social Choice to aggregate preferences that the agents define on a set of alternatives. There is a wide set of extensions. Some of them give rise to an impossibility theorem as in the Arrovian classical  mod...

متن کامل

The dilemma of Rationality or Providing Efficiency in Monetary Policy Making: An Application of Arrow’s

Financial frictions inducted in the model is a new contribution to monetary economics. Herein, an analytical tool arranges monetary policymaking in the form of two steps procedure. In the first step, an appropriate amount of money supply should be assessed; and in the second step, that appropriate amount should be allocated to several sectors. The Central Bank obligates the step of assessment a...

متن کامل

Rationality in the Full-Information Model

We study rationality in protocol design for the full-information model, a model characterized by computationally unbounded adversaries, no private communication, and no simultaneity within rounds. Assuming that players derive some utility from the outcomes of an interaction, we wish to design protocols that are faithful: following the protocol should be an optimal strategy for every player, for...

متن کامل

Investigating the Rationality of Behavioral Economics in Mental Accounting by ‎Studying Laboratory Economics

The purpose of this study is to investigate the rationality of behavioral economics in mental accounting by studying laboratory economics. Undoubtedly, economic man, whose fundamental characteristic is rationality, is the starting point for economic analysis. In conventional economics, the premise of rationality is the cornerstone and the premise of all economic theories. However, critics of ec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1712.05812  شماره 

صفحات  -

تاریخ انتشار 2017